Discovering Generalized Episodes Using Minimal Occurrences
نویسندگان
چکیده
Sequences of events are an important special form of data that arises in several contexts, including telecommunications, user interface studies, and epidemiology. We present a general and flexible framework of specifying classes of generalized episodes. These are recurrent combinations of events satisfying certain conditions. The framework can be instantiated to a wide variety of applications by selecting suitable primitive conditions. We present algorithms for discovering frequently occurring episodes and episode rules. The algorithms are based on the use of minimal occurrences of episodes; this makes it possible to evaluate confidences of a wide variety of rules using only a single analysis pass. We present empirical results on t,he behavior of t.he algorithms on events stemming from a WWW log.
منابع مشابه
Algorithms to Discover Complete Frequent Episodes in Sequences
Serial episode is a type of temporal frequent pattern in sequence data. In this paper we compare the performance of serial episode discovering algorithms. Many different algorithms have been proposed to discover different types of episodes for different applications. However, it is unclear which algorithm is more efficient for discovering different types of episodes. We compare Minepi and WinMi...
متن کاملInferring Neuronal Network Connectivity using Time-constrained Episodes
Discovering frequent episodes in event sequences is an interesting data mining task. In this paper, we argue that this framework is very effective for analyzing multi-neuronal spike train data. Analyzing spike train data is an important problem in neuroscience though there are no data mining approaches reported for this. Motivated by this application, we introduce different temporal constraints...
متن کاملMining Closed Episodes from Event Sequences Efficiently
Recent studies have proposed different methods for mining frequent episodes. In this work, we study the problem of mining closed episodes based on minimal occurrences. We study the properties of minimal occurrences and design effective pruning techniques to prune non-closed episodes. An efficient mining algorithm Clo_episode is proposed to mine all closed episodes following a breadth-first sear...
متن کاملDiscovering Associations between Climatic and Oceanic Parameters to MonitorDrought in Nebraska Using Data-Mining Techniques
Drought is a complex natural hazard that is best characterized by multiple climatological and hydrological parameters. Improving our understanding of the relationships between these parameters is necessary to reduce the impacts of drought. Data mining is a recently developed technique that can be used to interact with large databases and assist in the discovery of associations between drought a...
متن کاملDiscovering Neuronal Connectivity from Serial Patterns in Spike Train Data
Repeating patterns of precisely-timed activity across a group of neurons (called frequent episodes) are indicative of networks in the underlying neural tissue. This paper develops statistical methods to determine functional connectivity among neurons based on “non-overlapping” occurrences of episodes. We study the distribution of episode counts and develop a two-phase strategy for identifying f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996